Text To Face Generation


Text-to-face generation is the process of generating images of faces from textual descriptions using deep learning techniques.

MOVi: Training-free Text-conditioned Multi-Object Video Generation

Add code
May 29, 2025
Viaarxiv icon

DisTime: Distribution-based Time Representation for Video Large Language Models

Add code
May 30, 2025
Viaarxiv icon

One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

Add code
May 28, 2025
Viaarxiv icon

Facial Attribute Based Text Guided Face Anonymization

Add code
May 27, 2025
Viaarxiv icon

AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation

Add code
May 28, 2025
Viaarxiv icon

PreGenie: An Agentic Framework for High-quality Visual Presentation Generation

Add code
May 27, 2025
Viaarxiv icon

TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models

Add code
May 29, 2025
Viaarxiv icon

Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Add code
May 28, 2025
Viaarxiv icon

TokBench: Evaluating Your Visual Tokenizer before Visual Generation

Add code
May 26, 2025
Viaarxiv icon

Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals

Add code
May 27, 2025
Viaarxiv icon